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Abstract. We propose a mathematical model of DNA self-assembly us¬ 
ing 2D tiles to form 3D nanostructures. This is the first work to combine 
studies in self-assembly and nanotechnology in 3D, just as Rothemund 
and Winfree did in the 2D case. Our model is a more precise super¬ 
set of their Tile Assembly Model that facilitates building scalable 3D 
molecules. Under our model, we present algorithms to build a hollow 
cube, which is intuitively one of the simplest 3D structures to construct. 

We also introduce five basic measures of complexity to analyze these al¬ 
gorithms. Our model and algorithmic techniques are applicable to more 
complex 2D and 3D nanostructures. 

1 Introduction 

DNA nanotechnology and DNA self-assembly are two related technologies with 
enormous potentials. 

The goal of DNA nanotechnology is to construct small objects with high pre¬ 
cision. Seeman’s visionary work in 1982 pioneered the molecular units used 
in self-assembly of such objects. More than a decade later, double-crossover 
(DX) molecules were proposed by Fu and Seeman j|] and triple-crossover (TX) 
molecules by LaBean et al. [|| as DNA self-assembly building blocks. Labora¬ 
tory efforts have been successful in generating interesting three-dimensional (3D) 
molecular structures, including the small cube of Chen and Seeman Q. However, 
these are immutable and limited in size, mainly because their fabrication is not 
based on a mathematical model that can be extended as necessary. 

In parallel to DNA nanotechnology, studies on self-assembly of DNA tiles 
have focused on using local deterministic binding rules to perform computa¬ 
tions. These rules are based on interactions between exposed DNA sequences 
on individual tiles; tiles assemble into a particular ID or 2D structure when 
in solution, encoding a computation. Winfree [ p^ fo rmulated a model for 2D 
computations using DX molecules. Winfree et al. [|ll| used ID tiles for ID com¬ 
putations and 2D constructions with DX molecules. LaBean et al. Q were the 
first to compute with TX molecules. 
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Combining these two technologies, several researchers have demonstrated 
the power of DNA self-assembly in nanostructure fabrication. Winfree et al. [ p^ 
investigated how to use self-assembly of DX molecules to build 2D lattice DNA 
crystals. Rothemund and Winfree further proposed a mathematical model 
and a complexity measure for building such 2D structures. 

A natural extension of the seminal 2D results of Winfree et al. ||^ and 
Rothemund and Winfree Q would be the creation of 3D nanostructures using 
tiling. To initiate such an extension, this paper (1) proposes a general mathe¬ 
matical model for constructing 3D structures from 2D tiles; (2) identifies a set 
of biological and algorithmic issues basic to the implementation of this model; 
and (3) provides basic computational concepts and techniques to address these 
issues. Under the model, the paper focuses on the problem of constructing a 
hollow cube, which is intuitively one of the simplest 3D structures to construct. 
We present algorithms for the problem and analyze them in terms of five basic 
measures of complexity. 

There are three natural approaches to building a hollow cube. The first ap¬ 
proach uses ID tiles to form 2D DX-type tiles as in |^], and then uses these tiles 
to construct a cube. Our paper does not fully investi gat e this possibility because 
of the incovenient shape of these molecules (see Sect. |2.l| ), but our algorithms can 
be modified to accommodate these DX-type tiles. The second approach builds a 
cube from genuine 2D tiles, which is the focus of this paper. The third approach 
is perhaps the most natural: build a cube from genuine 3D tiles. It is not yet 
clear how such 3D tiles could be created; conceivably, the cube of Chen and 
Seeman ||^ may lead to tiles of this form. This paper does not fully investigate 
this possibility, either, because this approach is algorithmically straightforward 
and similar to the 2D case. 

The basic idea of our algorithms is to use 2D tiles to form a shape on the 
plane that can fold into a box, as illustrated in Fig. 1(a) -(b). We can easily 
synthesize a set of tiles to create the intitial 2D shape. To overcome a negligible 
probability of success due to biochemical factors, we must put many copies of 
these tiles into solution at once; but we must then worry about multiple copies 
of the shape interfering with each other, preventing folding, as in Figure |^(c). 

To avoid this problem, we introduce randomization, so that different copies of 
the shape have unique sticky ends. The growth of tiles into a complete structure 
must still be deterministic (as it is based on Watson-Crick hybridization), but 
we randomize the computation input — the seed tiles from which the rest of the 
shape assembles. The edges then still relate to each other, but depend on the 
random input that is different for each shape in solution. If each input can form 
with only low probability, interference with another copy of the shape will be 
kept to a minimum. 

This raises another important issue — that of using self-assembly to com¬ 
municate information from one part of the shape to another. Since the edges 
must relate to each other and the random input, designing local rules becomes 
nontrivial. In this paper, we explore and formalize patterns used in completing 
this task. In addition, we formalize biological steps that allow a specific subset 



Fig. 1. (a) 2D planar shape that will fold into a box. Each section is formed from 
many smaller 2D DNA tiles. Edges with the same number have complementary 
sticky ends exposed so they can hybridize, (b) Folding of the shape in (a) into 
a box. Here, edges 4, 5, 6, and 7 have all hybridized. Hybridization of edges 2 
and 3, whose two complements are now in close proximity, will cause edge 1 to 
hybridize and form the complete box. (c) Multiple copies of the 2D shape in 
solution. Copies of the shape can interfere and attach infinitely without control 
as long as edges have matching sticky ends. 


of tiles to be added in an isolated period of time, thus allowing better control of 
growth. We couple this with the use of temperature to improve the probability 
of a successful construction. 

The remainder of this paper is organized as follows. Section || describes the 
model of computation, including notation for DNA tiles and definitions of com¬ 
plexity measures. Section ^ describes the algorithms in detail, and Section || 
discusses future research possibilities. 

2 Model of Computation 

In this section we formally introduce our model of self-assembly, the Generalized 
Tile Assembly Models on both the mathematical and biological level. It is an 
extension of the model presented by Rothemund and Winfree in Q . 


2.1 Molecular Units of Self-Assembly 

We begin with the biological foundation for our model. We intend to build 3D 
structures using the folding technique shown in Fig. and allow construction of 
all 2D structures possible with the Tile Assembly Model. 

Our model relies on using the molecular building block of a DNA tile. Tiles 
can naturally hybridize to form stable shapes of varying sizes, and the individual 
tiles can easily be customized and replicated (via synthesis and PCR before the 
procedure) for a specific algorithm. 































DNA tiles are small nucleotides with exposed action sites (also known as 
sticky ends of a DNA strand) consisting of a single-stranded sequence of base 
pairs. When this sequence matches a complementary sequence on an action site 
of another tile, the Watson-Crick hybridization property of DNA causes these 
two molecules to bind together, forming a larger structure. A tile can be syn¬ 
thesized in the laboratory to have specific sticky ends. Different combinations 
of sticky ends on a tile essentially yield uniquely-shaped puzzle pieces. The tiles 
will automatically hybridize when left in solution. 

Most work in self-assembly uses DX and TX molecules for tiles, but the shape 
of these molecules causes a problem for 3D construction. Since the sticky ends 
are on diagonally opposite ends (see ^ and [||), these tiles form structures with 
ragged edges when they hybridize, as in Figure ^(a). Our algorithms can easily 
be modified to use these tiles by adjusting for proper alignment before folding 
into a box. 
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Fig. 2. (a) 2D structure formed from DX molecules. The left and right sides 
cannot hybridize because they are aligned improperly; the same is true for the 
top and bottom sides, (b) Branched-molecule DNA tile from |^. (c) Snynthetic 
DNA tile derived from the structure of tRNA. 


However, we propose a simpler alternative, which is using the branched 
molecules of Seeman or a variant derived from the structure of tRNA. These 
molecules, sketched in Figures ^b) and (c), are truly 2D with sticky ends on four 
sides. The structure is stable while the sticky ends are free-floating in solution 
— so the molecules have flexibility to align properly during folding. 

Such molecules offer a natural motivation for modeling them using Wang’s 
theory of tiling , which allows us to abstract construction using these molecules 
to a symbolic level. 


2.2 Symbolic Representation of Tiles 

Definition 1. A DNA sequence of length n is an ordered sequence of base pairs 
5' — bib2 • • • b„ — 3' where the sequence has a 5-prime and 3-prime end, and 
hi ^ B — {A, T, C, G}, the set of base pairs. We will assume that if the directions 
are not explicity written, the sequence is written in the 5' —> 3' direction. 































1. The Watson-Crick complement of sequence s = 5' — bib 2 • • • b„ — 3', denoted 
s, is the sequence]^ 3' — bib 2 • • • b„ — 5', where A = T, C = G. Define s = s. 

2. The concatenation of two sequences s = si • ■ • s„ and t = ti • • ■ denoted 
s • t, or simply st, is the sequence si ■ • ■ s„ti • • ■ 

3. The subsequence from i to j of sequence s = 5' — bib 2 ■ • - b^ — 3', denoted 
s[i : j], is the sequence 5' — b^bi+i • • ■ hj-ihj — 3', where 1 < i < j < n. 

Given the above definitions, two DNA strands can hybridize if they have com¬ 
plementary sequences. Formally, s = si • • • s„ and t — ti • • ■ can hybridize if 
there exist integers h^i, hs 2 , hti, ht 2 such that s[/isi : /is 2 ] = : ht 2 \- We as¬ 

sume there are no misbindings, that is, the above condition must be met exactly 
with no errors in base-pair binding. 


Remark 2. Note that st (or (s • t)) ^ st (or s • t); rather, st = t • s. 

Definition 3. The threshold temperature for a DNA sequence is a temperature t 
in some fixed set T such that the sequence is unable to remain stably hybridized 
to its complement when the solution is at a temperature higher than t' G {t — 
e,t + e) for e > 0.^ (Heating a solution generally denatures strands, so this 
definition has strong biological foundation. The consequences and methodology 
of using temperature in designing DNA sequences for tiles is discussed in Q.) If 
s has a lower threshold temperature than t, we say s binds weaker than t. 

As with most work in DNA computing, our model uses DNA sequences to 
encode information^]— in our case, an identifier specifying what kinds of matches 
are allowed between tiles on a given side. Since there are no misbindings, these 
identifiers map uniquely to DNA sequences present on the sides of tiles that can 
bind to each other. Formally, we have the following. 

Definition 4. Let S be the set of symbols used to represent the patterns on 
the sides of our tiles. We assume S is closed under complementation, that is, if 
s G S then there exists some s' € S such that s' = s where s is the complement 
of s (the purpose of this will be clear below). Let W C IJj S® be the set of DNA 
sequences called DNA words such that the words do not interfere with each 
other or themselves (i.e., bind inappropriately). We then define the injective 
map enc : iS —> W that is the encoding of a symbol into a DNA word. This map 
obeys complementation: enc (s) = enc(s). 

^ The assumption that sequences written without directions are given 5' 3® means 

that the complement of bib 2 • ■ - b^ is b„ • • •b 2 bi, which is not standard convention 
but is technically correct. 

^ A fixed set of threshold temperatures simplifies the model and corresponds to the 
temperature parameter in Q . To compensate we allow the actual threshold temper¬ 
ature to deviate slightly from the fixed point. 

® Condon, Corn, and Marathe ^ have done work on designing good DNA sequences 
for problems like this one. 






Definition 5. A DNA tile is a 4-tuple of symbols T = {sn,se,ss,sw) such 
that Si G S and enc(si) is the exposed DNA sequence at the north, east, south, 
or west action site of the tile, for i = N,E, S, W. Given two tiles Ti and T 2 , they 
will bind if two sides have complementary symbols. Properties of hybridization, 
including threshold temperature, carry over to the hybridization of tiles. We 
make a stronger no-misbinding assumption for tiles, requiring that the sticky 
ends on the tiles match exactly and fully. 

At this stage, our model exactly matches that of Rothemund and Winfree in 
[Q, except that our tiles can “rotate”; that is, (sat, se, ss, svv) = {se, ss, sw, sat). 
This corresponds more closely to tile structure. The model, at this point, could 
require many symbols to express different tile types, and possibly an exponential 
number of DNA words. Ideally, we would like to arbitrarily extend the symbolic 
or informational content of each side of a tile. Therefore we make the following 
generalization. 

Definition 6. Let A be a set of symbols closed under complementation, and let 
17 be a set of corresponding DNA words. A k-level generalization of the model 
defines a map g : S and a corresponding encoding gene : W, where 

genc(cr) = enc(g(cr)) for cr G such that an abstract tile definition, which is a 
4-tuple of fc-tuples of symbols in A, is equivalent to a DNA tile. 

We define complementation for a fc-tuple in A^ as follows: let a = (cri ,... ,crk) 
be (oT, ...,oT) so that genc(u) = en.c{g(a)) = enc(g(a)) = enc(g(cr)). This 
makes the hybridization condition equivalent to having complementary symbols 
in fc-tuples for sides that will bind. 

The definition is purposefully broad in order to allow different algorithms to 
define the encoding based on the number of words and tiles needed. A 1-level 
generalization with A = 5 and 17 = W where g(s) = s and gene = enc is the 
original Rothemund-Winfree Tile Assembly Model. In this paper, we use the 
following model. 

Definition 7. The concatenation generalization is a fc-level generalization where 
W C 17* and g maps every combination of symbols in A* to a unique symbol in 
S. Partition S into S' and S' such that each set contains the complement sym¬ 
bols of the other, and S' S' = 0.^ For a G S', define genc( 5 “^(CT)) = enc(cr) = 
UI 1 UJ 2 ■ ■ ■ where enc(CTi) = G 17 and g~^{a) = (cti, (72,..., Cfe). Then for 
aGS', let enc(CT) = Wfc • WfcTY• • • ZUT, so that genc((;“^(CT)) = genc((;“i(cT)). 

In other words, the concatenation generalization model is a straightforward ex¬ 
tension of the tile model where each side of a tile corresponds to a fc-tuple of 
symbols, where the DNA sequence at the corresponding action site is simply 

^ The map enc as defined will be one-to-one, and so is g, and so this can be done 
since the DNA sequence corresponding to any symbol has a nnique complement, 
and therefore a unique complement symbol. 








the concatenation of the encodings of the individual symbols.^ Using this simple 
model, we can reduce the number of DNA words needed to |i7| from \S\^, and 
create simpler descriptions of our tiles. 


2.3 Algorithmic Procedures 

With the above models for tiles, we now discuss procedures for growing larger 
structures. 

We follow Rothemund and Winfree [Q and Markov ||^ and use the common 
self-assembly assumption that a structure begins with a seed tile and grows, at 
each timestep, by hybridization with another free-floating tilej^ The new tile 
hybridizes at a given position following one of two types of rules: 

Deterministic Given the surrounding tiles at that position, only one tile type, 
with specific sticky ends on the non-binding sides, can fit. 

Randomized Multiple tile types (with different sticky ends on the non-binding 
sides) could ht the position given the tiles present; a new action site is created 
with probability proportional to the concentration of its tile type in solution. 

Therefore, to grow a structure, an algorithm repeats steps until the struc¬ 
ture is complete: add tiles to solution; wait for them to adhere to the growing 
structure; optionally removes excess tiles from solution by “washing them away.” 
Cycling temperature during these steps to prevent or induce binding (based on 
threshold temperatures) can be done while waiting for hybridization, and is 
called temperature-sensitive binding. 


2.4 Complexity 

We consider five basic methods of analyzing algorithms using our model. 

Time complexity Each algorithm is a sequence of self-assembly steps described 
above, thus the natural measure of time complexity in our model is the num¬ 
ber of steps required (which describes laboratory time). 

Space complexity The number of distinct physical tile types (not the actual 
number of molecules produced) is space complexity. Introduced by Q, this 
describes the amount of unique DNA synthesis necessary. 

Alphabet size The number of DNA words, or \fl\ or |>V|, has a rough labo¬ 
ratory limit [^, and so the size of the symbol set used (lAI or |5|), which 
corresponds directly to the number of words, has practical significance. 

® Potentially, the concatenation model could cause interference among tiles. If we 
maintain no misbindings, however, our model removes this from analysis. In addition, 
it is theoretically possible to design DNA words so interference does not occur, 
depending on the algorithm. 

® In reality, multiple tiles can hybridize at once and structures consisting of more than 
one tile can hybridize to each other, but we lose no generality with the Markov 
assumption. 



Generalization level The generalization level is the amount of information on 
a side of a tile. This is related to the length of the sticky ends (and thus has 
biological consequences) and the number of actual DNA words (via |5|). 

Probability of misformation Misformed structures contain tiles that are not 
bound properly on all sides. Assuming the Markov model, consider adding 
tile r to a partial structure S. If complete hybridization requires binding 
on two sides, but T manages to hybridize only on one side (while the other 
action site does not match), S + T has a misformation. We quantify this 
probability with the following. 

Definition 8. Let the success probability at step t be the probability that a 
free-floating tile in solution binds at all possible sides to a partial structure 
at a given spot. (Step t is the addition of a tile to that spot on structure St, 
resulting in St+i-) This is 

Pr(S't+i is correct | St is correct) = ^ 

^all 

where A^correct is the number of tile types that can correctly bind, while 
is til® number of tile types in solution that could bind, possibly even 
incompletely. Call this qt- Then the misformation probability at step t is 
Pt = 1—qt- If the algorithm has k additions, then the misformation probability 
for the algorithm is 1 — q^qi ■ ■ ■ qu-i- Then an algorithm is misformation- 
proof if its misformation probability at every step is zero, yielding a zero 
total probability of misformation. 


3 Hollow Cube Algorithms 

In this section, we examine algorithms designed to use our model to build a 3D 
hollow cube using the folding technique shown in Fig. ^ Let the length of a side 
of the cube, n, be the input to the algorithms. We present the most interesting 
algorithm in detail and discuss some others briefly. 


3.1 Overview 


Figure ^(a) illustrates the planar shape our algorithms construct. We will refer¬ 
ence the labels and shading of regions in the figure during our discussion. 

As stated earlier in Sect. ^ we must make each shape unique so different 
partial structures in solution do not bind to and interfere with each other. Once 
we have a unique seed structure, we can then use self-assembly with basic rules 
to make the edges of the shape correspond so folding will occur. 

There are three basic self-assembly patterns used to construct different parts 
of the shape. 


Random assembly Implements a random rule (see Sect. 2.3). Formally, add 
all tiles in a set of distinct tiles R in equal concentrations so each could 
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Fig. 3. (a) Regions of the 2D planar shape, (b) A straight-copy pattern, (c) 
A turn-copy pattern. 


potentially hybridize completely at a given position. Thus the information 
at that position is completely random. The tiles differ by a component of 
their exposed fc-tuples, assuming a fc-level generalization. 

Straight copy See Figure ^(b). Tiles are added to copy the pattern along one 
end of a region through to a parallel end of an adjacent region being con¬ 
structed. This rule is deterministic. 

Turn copy See Figure ^(c). Tiles are added to copy the pattern along one end 
of a region to a perpendicular end of an adjacent region being constructed. 
Counters will be required to position the tiles appropriately to complete this 
deterministic rule. 

The algorithm begins by assembling a random pattern string that will be copied 
to the top and bottom of the box. Then random patterns are added for the 
remaining edges of the box, and these are copied to the corresponding edges 
accordingly. (Refer to Fig. || for corresponding edges.) Finally, the shape that 
will fold is cut out by raising the temperature, assuming that the bonds between 
tiles along the region-borderline have weak threshold temperatures. The regions 
that are shaded in Fig. ^(a) are cut away. 

3.2 Notation 

All of our algorithms will use a 3-level concatenation generalization model; thus 
a tile is a 4-tuple of triplets that we write x Ts x Tw x Te, with each 
Ti = (cti, (72, (T3). We change the order of the tuple to more easily identify tiles 
that will bind, since most binding will be north-south or west-east. (This decision 
is arbitrary and purely notational.) We assume all tiles are oriented so that the 
directions are clear. 

We define the set 7T C A to be the “random patterns” tti , 712 ,..., TTp used as 
components of exposed triplets for tiles used in random assembly; their use will 
become clear when we discuss implementation of random assembly below. 

We use counters to control the growth of of our planar shape. This concept 
has been well explored in and earlier papers. Each tile can be assigned a 
position in the plane denoted by a horizontal and vertical coordinate. We create 
symbols for position counters and then allow tiles to hybridize if the positions 



















match, creating a mechanism for algorithms to place tiles in absolute or relative 
positions. Let H{i) and V{j) be the symbols denoting horizontal position i and 
vertical position j, respectively. 

3.3 Row-by-row Algorithm 

The row-by-row algorithm sacrihces time complexity to reduce space complexity 
and alphabet size. In addition, by using more steps and temperature-sensitive 
binding, the algorithm eliminates the possibility of misformations. 


Implementation of Random Assembly The base strip shown in Fig. ^(a) 
is created via random assembly and represents the unique seed structure for the 
shape. The pattern assembled on the strip will be copied to other edges. The 
strip will have length n as it is an edge of the cube, and so we use horizontal 
counters to control growth. We add the following tiles to solution: 


(/3,i?(o).y(o)) X {0,1m, v(^) X (p',ff(o),y(o)) x (7cr.i?(TT. vW) (i) 

X (;a,H(oy.V(-l)) x (Ki,ff(n-l),y(0)) x (p'.ff(O).V(O)) (2) 

(a,K2,V(0)) X (q,7cJ, y(-l)) X (Ki,ff(l),y(0)) X (w,H(2),V{0)'j (3) 

{a,K2,V{0)) X (cf.-Kl. y(-l)) X (Ki,ff(n-2),y(0)) X (w, H(n - 1), y(0)) (4) 

(^ic.K 2 .y(o)) X y(-i)) X (Ki,Hii),v(o)) X ( 7 tr.ff(*-i- i).y(o)) ( 5 ) 


where the tiles in (|^) vary over all i, 2 < i < n — 2, and all fc, 1 < fc < |7T|. 
Since the ends, given by (P and (P, the tiles bordering the ends, given by (P 
and (jj), and the pattern tiles in the middle of the strip, given by (H), all have 
appropriate horizontal counter markings H{i) for every position 0, ...,n — 1, 
every base strip must have the appropriate ends and length n. Note that the 
pattern on the exposed north and south side of tiles in position 2,... ,n — 2 
given by (P, is completely random as any one of the tiles containing some tti 
could hybridize there. Thus, this shape now has a unique identiher, namely, the 
sequence of patterns {iVi} along the base strip; the probability of any sequence 
expressed on a given shape is |7T|"“^. 


Implementation of Straight Copy We must complete a straight-copy from 
the base strip through regions A,B,C, and D, in addition to straight-copy pat¬ 
terns required elsewhere. This is done using 2n + 1 steps as the tiles for A and 
C are done in parallel (the same for B and D). In the following example for a 
straight-copy through A-D, assume that the triplet constant X = {ipi, (p 2 , ^ 3 ). 
The following tiles are added for each step i, 1 < i < 2n — 2: 


(a, K. 2 ,V{i)) X ^a, K 2 ,V{i — 1)^ X (<^1 , <72 , V(i)) X X 
(a, K 2 ,V{i)) X {a,1^2, — 1 }) X X X (ai, a 2 , V (i)) 


( 6 ) 

(7) 











{a,K2,V (— i)) X Ki', i — 1)) X (<Ti ,cr2,V (—i)) X X (8) 

(a,K, 2 ,V (—i)) X y(—i — 1)) xX X (cri , (T 2 ,V( — i)) (9) 

(TTfe. K2, X(i)) X (^,K2, X(i - 1)) X X X X (10) 

(TTfc, K2, — 0) X (w. x( — i - 1)^ X X X X (11) 


Tiles versions exist for all patterns in 77 when TTfc is present in the tile description 
above. We assume that X binds weaker than K 2 , so cycling the temperature 
ensures the tiles are attached on the K 2 side. 

The tiles are added sequentially to prevent misformations. It is evident that 
the borders of these regions, given by §-§) have symbols on the exposed sides 
to hybridize with a strip of tiles that will form a folding edge of the box, and 
the middle tiles copy the random pattern to another location. 

After 2n — 2 of these steps, the middle regions are complete, except for two 
rows on top (rows 2n — 1 and 2n) and one row on the bottom (2n — 1). The 
order of these rows is important, as we add an extra set of tiles to prevent the 
top and bottom from hybridizing before the folding is complete. 


Implementation of Turn Copy The turn-copy step, for example, copying the 
bottom edge of E to the left edge of D through I so the shape can fold, is done 
using vertical and horizontal counters, which essentially places a tile in a specific 
spot. Therefore we can add all the tiles at once to complete the region without 
possibility of misformation. For the above example region, we would add the 
following tiles. Let i and j vary such that —n < i < —1 and —2n -I- 1 < j < —n. 
For all i,j, add: 

i = j + V{j))x H(i), V{j - 1)) x(K 3 ,H{i), V{j)) X (tt)", H(i - 1), V(j)'j (12) 

i < j + in-1), (Trk,H{i), V(j)) X (jfk , H(i), V{j - 1)) x(k 3 , ff(z), V{j))x (k?, H(i - 1), V(j)^ (13) 
i > j+in-i), (k 3 , H{i), V(j)) X (k?, H{i),V{j - 1)) x{'Wk,H{i), V(j)) X (^, H(i - 1), V(j)'j (14) 

The above copies the pattern through I, and the following adds the left edge to 
D, where —2n -I- 1 < 7 < —n — 1: 

(/3, H{0), V(—n)) X (jis, H{0), V{—n — 1)^ x (tt;,, H{0),V{-n)) x (oT, 5^, X(-ri)^ 
(K3,ff(0), V(i)) X - 1)) X {■Kk,H{0),Vit)) X (5T.5^.X(i)) 


As an extra precaution we can set K 3 binding to be weaker than tt^, and cycle 
the temperature several times. In addition, we force the encoding of horizontal 
and vertical counters at the edges where the folding occurs to be the same, so 
that the sticky ends are in fact complementary. 















Summary After random assembly of the base strip, we use a row-by-row 
straight copy using a one-dimensional counter through regions A-D to copy the 
base strip’s pattern. We then use straight copy to fill in the bodies of regions 
E and F and add the edges using random assembly, as these will correspond 
to other portions of the shape. We then use a turn copy through G-J to make 
those edges correspond. Finally, we do a sequence of straight and turn copies 
from E and F through K-ISI and 0-R to complete the shape. We then raise the 
temperature to cut away the shaded regions. 


Analysis of the Row-by-row Algorithm 
Theorem 9. | Aj = 8n -I- |i7| -I- 0(1). 

Proof. The symbol set S consists of the following: pattern symbols in 77; 4n 
horizontal and 4n vertical counters for positions throughout the shape; and some 
specialized symbols, not dependent on the inputs. □ 

Theorem 10. The algorithm has has time complexity approximately 5n. 

Proof. The algorithm consists of the following steps: 1 step to build the base 
strip; 2n-|-l steps to complete A-D; n steps to grow E and F; 1 step to add random 
patterns to the leaf borders; 1 step for the first set of turn-copy steps (G,I,H,J); 
and 2n -I- 2 steps to complete regions K-ISI and 0-R. The time complexity for the 
basic version, therefore, is approximately 5n steps. □ 


Theorem 11. The space complexity of the row-by-row algorithm is approxi¬ 
mately 6|7T|n^ -I- 10|7T|n -|- 4|77| -|- 8n tiles. 

Proof. Clear from counting the number of tiles required, including all the nec¬ 
essary variations of the same tile for position and pattern. □ 


Theorem 12. The number of distinct temperatures reguired is 3. 

Proof. One temperature is required for detaching the excess portions of the 2D 
shape, and the highest temperature level is for the main portion of the box that 
remains intact. Adding tiles in rows involves cycling the temperature to prevent 
misformations, by ensuring the “random-patterned” side of the tiles bind. This 
requires a third temperature between the two discussed where potential misfor¬ 
mations denature. □ 


Theorem 13. The misformation probability of row-by-row is 0. 



Proof. The random assembly steps grow the shape in one direction using a one¬ 
dimensional counter, and only the correct tiles are marked with the appropriate 
counters. Thus the probability for these steps is zero. The turn-copy steps use 
both vertical and horizontal counters dehning a two-dimensional position that 
bind weaker than the random patterns. As these are the only three words on 
tiles in this pattern, these also have a misformation probability of zero. Finally, 
because the straight-copy steps are performed in a sequence and all the potential 
spots for misformation are sealed with tiles one row at a time, the probability 
of misformation here is also zero. Thus the misformation probability of the al¬ 
gorithm is 0. □ 


3.4 All-together Algorithm 

The row-by-row algorithm does not take full advantage of the parallelism of 
self-assembly. Instead of adding one row after another, we can synthesize tiles 
with both horizontal and vertical counters, so each tile type can fit at only one 
absolute position. Then we can begin with virtually all our tiles in solution and 
allow the assembly to proceed in constant time. 

Random assembly is performed as above, with special one-dimensional coun¬ 
ters to control length. However, these strips must be assembled separately and 
independently to prevent interference. The sequences exposed opposite the ran¬ 
dom patterns will dictate where these strips can later hybridize on the larger 
shape, and the random patterns will cause growth that makes the shape unique, 
as needed. The turn copy implementation is unchanged, as counters are already 
present. The straight copy, rather than using a “constant” symbol, must instead 
use counters for growth in both directions. 

Although this algorithm is straightforward, with all the tiles in solution at 
once, we might get incomplete partial structures. Specihcally, tiles can bind on 
sides that have counters but do not have any tt^ component, thus creating shapes 
with no information allowing them to fold. In this case, at any of the 0{n?) 
positions, a misformation could occur if the counters bind but the patterns do 
not, and so the misformation probability of all-together is 1 — . We can 

prevent this by requiring the counters to bind weaker than the and cycle the 
temperature (requiring 3, rather than 2, distinct temperature levels), giving a 
misformation probability of 0, but practically, this has little meaning since we 
may still get incomplete, partial structures and may have to wait a long time for 
proper hybridization to occur. 

Besides this drawback, this algorithm performs well by other complexity mea¬ 
sures. Since counters are necessary for the row-by-row algorithm, the alphabet 
size is not greatly increased. The algorithm has space complexity 0(n^ |7T|), es¬ 
sentially because a different tile type is created for every position and every 
pattern that could be present there. 



3.5 Other Algorithms 

We have considered several other algorithms using the same 3-level generalization 
model, experimenting with different uses or omissions of counters and other 
implementations of the self-assembly patterns. 

The by-region algorithm keeps space complexity and time complexity low by 
removing counters and controlling growth in only certain rows and columns of 
a region. The probability of misformation is increased. 

The border-first algorithm has the highest misformation probability, although 
has a very low time and space complexity. In this algorithm, the frame of re¬ 
gions are constructed first, when possible, and then “filler tiles” are added in 
to strengthen the structure later. There are potential stability problems during 
shape growth. 

Finally, another possibility is to build the six faces separately and allow them 
to hybridize, but with this method there is little control over how the final shape 
actually forms in solution. 

4 Conclusion 

Our paper introduces a precise extension to the Tile Assembly Model |0 that 
allows greater information content per tile and scalability to three dimensions. 
The model better formalizes the abstraction of DNA tiles to symbols, intro¬ 
duces five complexity measures to analyze algorithms, and is the first to extend 
nanostructure fabrication to three dimensions. 

In addition, our paper opens up wide-ranging avenues of research. 

First of all, it may be possible to encode information on tiles more succintly 
than our algorithms do to accomplish the copy patterns discussed. The existence 
of good 2-level or 1-level generalization algorithms is unknown. 

Algorithms to form other 3D structures, having various applications in biol¬ 
ogy and computation, can be studied. More work also must be done to quantify 
the probabilities specified in the paper (possibly including a free-energy analysis 
of tile binding). 

Finally, there remain some important biological issues. In particular, design of 
a strong tile suitable for our method of computation and design of a 3D building 
block are two important steps to increasing the feasibility of 3D self-assembly. 
The use of temperature may be further refined and exploited to improve some 
complexity results and the number of steps needed in the lab. 
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